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We explore how the social dynamics of communication and learning can bring about the rise of 
a syntactic communication in a population of speakers. Our study is developed starting from a 
version of the Naming Game model where an elementary syntactic structure is introduced. This 
analysis shows how the transition from non-syntactic to syntactic communication is socially favored 
in communities which need to exchange a large number of concepts. 
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I. INTRODUCTION 



The study of the evolution of languages and their struc- 
tures has generated a very rich debate crossing different 
disciplines and approaches. 

The ideas developed in the linguistic community, which 
introduced the thesis of considering some linguistic struc- 
tures as innate with some specific properties genetically 
encoded in a language module or organ [1] have been 
very prolific. They bootstrapped the development of 
many works where pure evolutionary perspectives are in- 
troduced to explain the generation of languages. Here, 
the dominant paradigm is the Darwinian evolution of bi- 
ological systems. The description of language evolution 
is based on a biological dynamics constructed above the 
concepts of natural selection, mutation, and fitness, elab- 
orated in terms of communication success [2]. This ap- 
proach is particularly well suited to describe evolution 
from a functionalist perspective, where the category of 
utility is the one that drives the dynamics. In any event, 
many linguistic properties appear to be so highly ab- 
stract as to even hinder communication [3]. This means 
that they are quite difficult to be introduced on purely 
functional basis and they can not be explained merely 
in terms of communicative effectiveness or cognitive con- 
straints. Moreover, it is hard to explain how shifts from 
learned linguistic conventions can be fixed into geneti- 
cally encoded principles necessary to evolve a language 
module. Cultural conventions change much more rapidly 
than genes and the Baldwin effect, a possible Darwinian 
solution to this challenge, can not be the solution to this 
puzzle [4] . Biological models can be seen more as a pow- 
erful metaphor for studying the effects of random copying 
and selection, but more specific mechanisms, typically re- 
lated to cultural transmission, should be considered. 

Recently, attention was paid to defining specific cul- 
tural dynamics, directly related to linguistic ability. The 
mechanisms which define the dynamics of the evolution 
of language are different from those underlying biolog- 
ical evolution. Language is transmitted among people 



through learning and not DNA. It is shaped by processes 
of cultural transmission across generations of language 
learners. In this view, linguistic constructions are not 
innate, but rather they are acquired through some form 
of probabilistic learning. This learning process, articu- 
lated on the use of cognition-general principles, has be- 
come the central issue governing language evolution. In 
fact, learning defines the dynamics of linguistic variants 
and the differences among language learnability control 
these dynamics. Learnability is quantified measuring the 
learner's capacity to recover a complete description of a 
linguistic construction to which she/he has been exposed 
sufficiently [5]. Several works have studied the differen- 
tial learnability of competing linguistic variant [6,7], and 
also their dynamics in the absence of selection [8]. 

Biological dynamics and cultural dynamics of learn- 
ing are two central issues which determine the creation 
of hnguistic structures, but they are not the only ones. 
Language is constructed for communication. It is not 
only the basis for social relationships, but it is also based 
on social relationships. Individual learning is just one 
aspect of a more general and collective process. The 
fixation of linguistic conventions among a population of 
speakers is another dynamics related to the linguistic def- 
inition, and the structures which appear to be learnable 
at an individual level must be fixed socially. These social 
dynamics cause a pressure on language, which shapes a 
shared communication system. This is a form of collec- 
tive learning. It is important to perceive how on a social 
level even completely arbitrary linguistic properties can 
succeed. In fact, if the same convention is adopted by all 
members of a community this convention can work and 
finally it becomes fixed. The only important fact is that 
everyone adopts the same set of culturally mediated con- 
ventions. Even different conventions, if equally effective, 
may serve equally well if there are no costs or no con- 
flicting functional pressures. Fixation of structures is not 
driven by a fitness or a learnability advantage, but rather 
by the mechanisms which generate consensus about the 
linguistic elements used by speakers. This process is not 
necessarily a functional process and is not only driven 
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by utility. A lot of works related to this social aspect 
of language appeared recently, following a seminal paper 
by L. Steels [10]. These ideas were first used to describe 
the birth of neologisms, and they have been tested by an 
artificial experiment in which embodied software agents 
bootstrap a shared lexicon without any external interven- 
tion [9]. Robots concretize a language game developing 
a vocabulary throughout a self-organized process called 
Naming Game. Recently, these studies have also at- 
tracted the interest of the Statistical Physics community. 
An initial study in this direction [11], was inspired by 
experiments conducted with the use of robots [10]. In 
that work each player is characterized by an inventory 
of words which can all be used to name the same object. 
At each time step two players, randomly chosen, interact 
following some simple rules. These dynamics force the 
system to undergo a disorder /order transition towards 
an absorbing state characterized by a common word used 
by all the players. These ideas have been developed to 
describe other phenomena, such as the emergence of 
universality in color naming patterns [12] and the self- 
organization of a hierarchical category structure in a 
population of individuals [13]. 

The ideas and considerations we have exposed can be 
applied across different linguistic levels: lexicon, phonol- 
ogy, morphology, and syntax. In this work, we will focus 
our attention on syntax. Syntax is a process to combine 
progressively symbolic units in an ordered output which 
falls within the quite narrow bounds that delimit human 
language. This is obtained by merging words into larger 
units and superimposing algorithms that determine the 
reference of items that might otherwise be ambiguous or 
misleading [14]. As proposed in the Minimalist Program 
[15], the basic syntax-creating process is Merge, a process 
that takes two units (words, phrases, clauses) and forms 
them into a single one satisfying some constraints. This 
means that Merge has many restrictions on the items to 
be merged, and there is a consistent way of merging them. 
As we are interested in the transition from non-syntactic 
to syntactic communication it is reasonable to look for 
the simplest advance from the pre-syntactic (one-word) 
stage, even if it does not specifically correspond to the 
syntax of some present-day language. This first step can 
be identified with the most basic (proto-)syntactic com- 
bination: flat concatenation of two symbols, where all 
the possible combinations are functional [16]. This cor- 
respond to a purely linear bead-stringing process, a prac- 
tice which underlies protolanguages, like the one used by 
speakers of a pidgin language [17]. 

In our work we are interested in exploring the transi- 
tion from non-syntactic to syntactic communication from 
a social dynamics point of view. Directly following the 
ideas of Nowak et al. [18], the example that we are going 
to explore can be stated in this way. Let us consider the 
situation where a speaker is interested in communicating 
some concepts. If she/he uses a non-syntactic language 
a symbol (word) is used for each concept. In the case of 



a syntactic language a combination of two symbols, for 
example one for the object i and one for the action j, 
can be used to communicate the concept Cij [18]. In the 
following we will consider the simplified situation where 
the number of object and action exchanged in the com- 
munication are the same {S). Moreover, all the possible 
combinations of these symbols can occur and correspond 
to a meaningful concept. It follows that, in this model, 
the possible combinations can be represented by a matrix 
Cij = S X S (see Table I). For example, as a particular 
situation we can think that the lines elements represent 
nouns and the column elements verbs. A population 
of individuals coevolve this system of symbols, with or 
without syntax, by playing elementary language games 
(Naming Game) analogous to the ones introduced in 
[11]. In this way we can analyze the differences between 
syntactic and non syntactic communication and we can 
distinguish when the transition from non-syntactic to 
syntactic communication is socially favored. 





a 


b 


c 


d 


a+d 


b+d 


c+d 


e 


a-|-e 


b+e 


c+e 


f 


a-hf 


b+f 


c-hf 



TABLE 1. In a Naming Game with syntax for the commu- 
nication of 9 concepts we represent these concepts using the 
matrix C = 3X3. Each concept is specified by the couple 
formed by two different possible words contained in two dif- 
ferent inventories among the six inventories a,b, c, d, e, f . For 
example, in a concrete situation a, b, c can stand for an object 
(noun) and d, e, / for an action (verb). 
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The paper is organized as follows. The Section 11. A 
introduces a version of the basic Naming Game model 
for the communication of one concept and Section II. B 
illustrates the model with one concept and syntax. Sec- 
tion II. C describes the generalization of the model for 
many different concepts, in the case of a syntactic or non- 
syntactic communications. In Section III. A we show the 
numerical results obtained for a one concept model using 
syntax, or not, along the communications. Section III.B 
is devoted to illustrating what happens with the intro- 
duction of syntax in a many-concept Game. Conclusions 
are reported in Section IV. 



II. THE MODEL 
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FIG. 1. (Color online) Example of the dynamics of the in- 
ventories for a one concept model without syntax. On the left 
we present a successful Game, on the right a failed one. We 
represent the 32 possible symbols with only five slots for the 
sake of clarity. When a symbol is not present in the inventory 
the corresponding bit is set to 0, otherwise, when the symbol 
is present, the bit is set to 1. The shadowed elements are the 
ones transmitted by the speaker. 



A. The basic model for one concept 



The Naming Game is played by P agents who try 
to reach consensus in naming a single concept. An in- 
ventory, which contains an arbitrary number of words, 
represents each agent. Population starts with empty in- 
ventories. At each time step, two agents are randomly 
selected; the first one assumes the role of speaker, the 
second one of hearer. Then, the following microscopic 
rules [11], control their actions: 

1) The speaker retrieves a word from its inventory or, 
if its inventory is empty, invents a new word. 

2) The speaker transmits the selected word to the 
hearer. 

3a) If the hearer's inventory contains such a word, the 
communication is a success. The two agents update their 
inventories so as to keep only the word involved in the 
interaction. 

3b) Otherwise the communication is a failure. The 
hearer learns the word communicated by the speaker. 

The players invent new words choosing among 32 pos- 
sible ones with equal probability. In contrast with the 
classical implementation of the model [11] we work with a 
fixed maximum number of possible different words. This 
is the only difference between our implementation and 
the classical one. We introduce this simplification with 
the goal of implementing a light model that can be easily 
generalized for the description of the naming process for 
more than one concept. For this reason we need a fast 
algorithm. This is obtained by using boolean program- 
ming techniques which causes a veritable improvement in 
the computational times. An example of these dynamics 
is represented in Figure 1. 



B. The syntax model for one concept 

Now we consider a context in which one single concept 
is exchanged with the use of compositionality, a rudimen- 
tal form of syntax. In this situation, the concept is rep- 
resented by a couple of symbols ax-\- f^y, each one sorted 
from a different inventory and Ky. It follows that 
each agent is characterized by two different inventories 
Kx and Ky. At each time step, the following microscopic 
rules control the communication: 

la) The speaker retrieves a word {ax) from its inven- 
tory Kx] if its inventory is empty, the speaker invents a 
new word. 

lb) The speaker retrieves a word {^y) from its inven- 
tory Ky] if its inventory is empty, the speaker invents a 
new word. 

2) The speaker transmits the selected pair of words to 
the hearer. 

3a) If the hearer's inventories Kx and Ky contain the 
pair of words (ax^/Sy), the communication is a success. 
The two agents update their inventories so as to keep, in 
each one, only the correspondent words involved in the 
interaction {ax in Kx and (3y in Ky). 

3b) Otherwise the communication is a failure. The 
hearer adds the words he does not know (one or two) to 
the corresponding inventory /ies; the speaker does noth- 
ing. 

Here we hypothesize that the learning of segmented 
elements of the utterance is possible even if the com- 
munication is a failure. This idea is supported by the 
fact that it is not necessary to have any positive feed- 
back to identify the components of a speech. In fact, 
some popular experiments shown how very young infants 
can achieve the task of word segmentation of an utter- 
ance with only minimal exposure, just by exploiting the 
transitional probabilities between syllables [19]. Even so. 
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the ability to use exclusively statistical information com- 
ing from a passive exposure to process a given language 
stream seems to be confined to the individuation of the 
segments of a stream, but not to acquiring the general- 
ization correspondent to a syntax structure [20] . For this 
reason, we make the hypothesis that the fixation of the 
structured element olx + P>y is possible only if there is a 
communication success. Which means, on the basis of a 
exposure to a positive feedback. 



FIG. 2. (Color online) An example of the structure and 
evolution of the inventories for the model with syntax when 
four concepts (iS = 2) are exchanged. In this particular ex- 
ample the number of inventories (25') is equal to the number 
of concepts. We can think that the first two inventories rep- 
resent nouns and the third and fourth stand for verbs. The 
starred inventories are the ones corresponding to the concept 
sorted out in a specific communication event. The shadowed 
elements are the ones transmitted by the speaker. 



C. Many concepts games 



In this situation agents develop communications which 
can exchange C different concepts. If no syntax struc- 
tures are present, each concept is represented by one 
symbol. If syntax is introduced, a combination of two 
symbols, + each one picked up from a different in- 
ventory, represents each concept. As in the basic model, 
each symbol is represented by one of 32 different possible 
words. 

It follows that for a C concepts Game, if there is no 



syntax, each agent is represented by 



1,2,..,C 



inventories, each one containing no more than 32 words. 
These words are exchanged in the same way as in a single 
concept Naming Game without syntax. 

In the case of a syntactic communication, as explained 
in the introduction, the concepts Cij are represented by 
the elements, generated by the combination of two sym- 
bols, of a matrix Ci^ = SxS. It follows that a C concepts 
Game with syntax is obtained by introducing agents rep- 
resented by Kz, z = 2,.., 26' inventories, each one con- 
taining no more than 32 words. In Table I we give an 
example of a Game with nine concepts. At each time 
step a concept is chosen determining the two inventories 
that represent it (for example Ka and from which 
we represent the concept with the couple aa -\- f^e)- The 
dynamics of each single communication are the same as 
the one concept game with syntax (an example of a four 
concepts game is represented in Figure 2). 









io|i[g|oj 


|1|0|1|0|0| 




o|i|i|o|i|* 


p|l|0|0|Ok 




1|0|0|0|0| 


i|o|o|o|ol 




ilolilolil* 


|0|0|0|0|1|^ 


SPEAKER 





SUCCESSFUL COMMUNICATION 




SPEAKER 




UNSUCCESSFUL COMMUNICATION 



III. RESULTS AND DISCUSSION 



A. One concept model 



We describe the time evolution of our system look- 
ing at some usual global quantities [11,21,22]: the total 
number of words (TVtot) present in the population and 
the success rate (Rs)^ which measures the average rate 
of success of communications. This is obtained by eval- 
uating the frequency of successful communications in a 
given time interval. 

The basic model is a simplified version of the original 
Naming Game where the number of different words in- 
troduced in the system is limited and sorted among a 
cache of 32. As a result, a homologous behavior occurs 
with some limited differences (see also [21] for similar re- 
sults). In our version of the model the maximum number 
of words for each agent is constant and it does not scale 
with the square root of the population. This important 
fact is responsible for remodeling the temporal scaling 
behavior. Anyway, it is important to point out that, as 
the mean number of words for each player is always well 
below 32, the model maintains all the basic features of 
the original one and the possibility of the invention of 
new words is not affected. 

All agents start with an empty inventory and an ini- 
tial transient exists which corresponds to the rise of 
A^tot(^)-This stage finishes when this quantity attains a 
maximum value {Max[Ntot] ^ 3.8P) which is maintained 
along a plateau. When the redundancy of words reaches 
a sufficiently high level, the number of successful plays 
increases. The curve Ntot{t) begins a decay towards the 
consensus state, corresponding to one common word for 
all the players, reached at time T. In Figure 3 we report 
the temporal evolution for Ntot{t) and Rs{t). 
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FIG. 3. (Color online) One concept model. Top: Model 
without syntax. We present the temporal evolution for the 
total number of words divided by the total population. The 
inset shows the success rate Rs{t). Bottom: The same data 
for the syntactic model. All data are averaged over 1000 sim- 
ulations. 

As shown in [11], it is possible to estimate the maxi- 
mum number of total words using some simple analytical 
considerations. If we represent the mean total number of 
words for an agent, at time step with n(t), and the 
mean total number of different words with D{t) we ob- 
tain: 



n(t + l) -n{t) 



n{t)\ D(t)J2 



1 n{t) 



{n{t) - 1) 
(1) 



We are considering that the probability for the speaker 
to communicate a specific word is and the probabil- 
ity for the hearer to own that word is It follows 
that the first term represents the gain term for a failed 



communication (which increases n{t) by 1/2), and the 
second term represents the loss term (which decreases 
n{t) by n{t) — 1). We can use this equation for describ- 
ing the P dependence. If we assume that at the plateau, 
where we can consider n{t -\- 1) — n(t) = 0, n(t) scales as 
aP^ , and that I) ^ 32, we can write: 



1 



32^ 



1) 



(2) 



This equation reduces to oc P^, which forces /3 = 0. 
This fact implies that the number of total words for 
each player is not dependent on P. It follows that 
Max[Ntot] oc P, as can be seen in Figure 3. Equa- 
tion 2 can also be used to evaluate the exact numerical 
value of the plateau. It is sufficient to consider aP^ as 
a constant and the corresponding value is 4.25. If we 
take into consideration that our equations are a mean 
field approximation which does not account for the cor- 
relations builded up between the individuals' inventories, 
the value is comparable with the result 3.8 obtained by 
the simulations. 

We explored the behavior of the convergence time T^^ 
(the index 1 stays for a one concept play). As stated 
before, this is the time at which the system reaches the 
consensus state, corresponding to one shared word for 
all the players. We studied the dependence of T^^ on P 
averaging over different simulations obtaining, through- 
out a regression, the following dependence: T^g(P) ^ 
-22.1P-f 7.6PlnP (see Figure 4). This result corre- 
sponds to the average convergence time over different 
simulations, which is obviously different from the con- 
vergence time of the mean simulation (the one presented 
in Figure 3). These analyses, as well as the following 
ones, are consistent for sufficiently large populations. 

We can support these numerical results with some an- 
alytical considerations analogous to the ones presented 
in [23]. During the time evolution we can distinguish 
two periods. A first interval, between t = and the time 
when the system reaches the maximum number of words 
{tmax)i which clearly scales linearly with the population 
size (see Figure 3). A second interval, between tmax and 
T^g, which is governed by the following dynamics. To 
reach convergence, the mean number of words for each 
individual, which does not depend on P, has to decrease 
to 1. As at each play the loss term does not depend on P, 
from the definition of the dynamics of our model a nec- 
essary condition for convergence is that each agent must 
win at least once. For this reason, near convergence, the 
number of agents which did not have a successful interac- 
tion (P*) should be finite. We can estimate this number. 
In fact P*(t) = P(l - Rs{t)/P)\ where 1/P is the prob- 
ability of selecting an agent and Rs(t) corresponds to 
the probability of a success . As can be appreciate in the 
inset of Figure 3, Rs{t) does not depend on P and it is 
practically constant for a long time after tmax- It follows 
that (1 - 1/P)^^^// (X 1/P, where td^ff = T^^ - tmax- 
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For large P we obtain: tdiff cx Plog(P). This condition 
turns out to be sufficient when confronted with the nu- 
merical data. In fact, T^^ results to be very well fitted 
by a function of the type ciP + C2PlnP. 



We can unfold a similar analysis for the syntactic 
model with one concept. In this case we are implement- 
ing a Game where each agent is represented by two inven- 
tories which follows the rules presented in the previous 
section. From the results of our simulations we can ob- 
serve that the crucial features that determine the scaling 
properties of the convergence times are also maintained 
in this scenario. In fact, as can be seen in Figure 3, 
Max[Ntot] P, tmax scalcs linearly with P and Rs{t) 
continues independent of P. As the same arguments pro- 
duced for estimating T^^ continue to be valid, we can 
expect the dependence on P to have the same func- 
tional form of T^g. Fitting our numerical data with such 
function (see Figure 4) we obtain the following scaling 
relation for the convergence time for a syntactic model 
with one concept: T}{P) ^ -213.2P + 52.7PlnP. 




FIG. 4. We present the convergence time as a function of 
the population size for the one concept model. Top: The 
model with no syntax is well fitted by the depicted relation: 
T^s ~ -22.1P + 7.6PlnP. The model with syntax is well 
fitted by the relation: ^ -213. 2P + 52.7PlnP. Bottom: 
Rescaled convergence times for the model without syntax (on 
the left) and with syntax (on the right). The rescaled times 
are well fitted by the function ai + a2 In P (continuous lines). 
A power law fit of these data (^iP^^), represented by the 
dashed lines, turns out to be moderately less accurate than 
the previous one, which can be derived from some theoretical 
considerations. 




FIG. 5. (Color online) The convergence time as a func- 
tion of S for different populations. The linear regres- 
sions are very well approximated by the general expression 
TJ^* ^ 1.5ST}{P) (the continuous lines). The inset shows the 
convergence time as a function of P for different S values. The 
convergence times for 5* = 4, 9 are rescaled by the factor 1.5S. 
The continuous lines are the rescaled curves ciP + C2PlnP 
obtained from the regressions. The rescaling transformation 
produces a good collapse of data and curves. Data are aver- 
aged over up to 1000 simulations. 



B. Many concepts model 



First, it is useful to introduce a new quantity, the total 
convergence time T^^*. This is the time when the system 
reaches the consensus state, corresponding to one shared 
expression for each concept and for all the players. Equiv- 
alently, it is the time when every communication event, 
relative to any one of the possible concepts, is a success. 

Given an experiment set up so that C concepts are ex- 
changed, if there is no syntax, the behavior described for 
the basic Game with no syntax is reproduced for each one 
of the concepts, and so it is easily generalized. In fact, 
the dynamics of each concept is obviously independent 
of those of the other concepts. This fact implies that if 
there is no syntax Tj/* = CTi(P) = S^TUP)- 

This is not the case if we introduce syntax. In this case, 
as we stated before, we consider the situation in which 
the matrix Cij = S x S represents all the exchanged con- 
cepts. We explored numerically the behavior of the total 
convergence time as a function of the number of concepts. 
We assume as a null hypothesis that Tj^* depends lin- 
early on the dimension of the matrix Cij. As presented 
in Figure 5 the analysis of the data from simulations sug- 
gests that this scaling relation is satisfied. Moreover, for 
5 > 2, we can express the regression coefficients for dif- 
ferent population sizes with a simple expression which 
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uses T}{P): TJ^* ^ 1.5ST}{P). This expression is de- 
rived from a numerical analysis of the data obtained from 
simulations. Some results supporting this scaling are re- 
ported in Figure 5. 



Starting from the scaling relation for the convergence 
time in dependence on P and S we can determine the dif- 
ferent behavior generated by the introduction of syntax. 
Quantifying this convergence time allows us to determine 
the strategy that enables a more effective communica- 
tion. In fact, reaching consensus at a collective level cor- 
responds to an efficient communication at an individual 
level. Depending on the number of concepts exchanged 
and on the population size we can determine if the total 
convergence time for a model with syntax is shorter than 
that for a model without syntax. This situation is at- 
tained if l.bST}{P) < S'^T^^{P). Using this estimation 
we are able to determine a critical value of S for which 
the emergence of syntax is viable: S > 1.5Tg{P)/T^g{P). 




FIG. 6. (Color online) Temporal evolution of the suc- 
cess rate for a different number of exchanged concepts (C) 
for a model without syntax or with syntax. For C > 81, 
TJ^* < C7T^3, which means a faster convergence for popula- 
tion using syntax. P = 25000 



IV. CONCLUSIONS 



From this relation it follows that, if the number of ex- 
changed concepts is sufficiently large in relation to the 
population size, the syntactic model is able to generate a 
faster convergence towards consensus. The dependence 
on the population size is very weak. For a population of 
2000 individuals S = 8 is sufficient for the conventional- 
ization of syntax, and for a population 100 times larger 
it is sufficient to select 5* = 10. So, from an empirical 
point of view, for typical populations, the relevant factor 
is simply the number of exchanged concepts, an interest- 
ing fact that enhances the possibility of syntax to emerge 
as an auto-organized process. The results obtained using 
our approximation were confirmed by different numeri- 
cal simulations. In Figure 6 we present an example for 
P = 25000. In this case the matrix dimension should 
be bigger than 8 and effectively, from our simulation, if 
we exchange 81 concepts {S = 9) the syntactic model 
clearly performs better than the non syntactic one. In 
others words, the introduction of syntax generates a so- 
cial communicative advantage when language must cope 
with a lot of concepts and when it is employed in smaller 
communities. In this context, the transition from non- 
syntactic to syntactic communication is socially favored. 



We formulate a simple framework to explore the possi- 
bility of the emergence of an elementary syntactic struc- 
ture fixed by the social dynamics defined by communica- 
tion. We start with a version of the Naming Game model 
generalized for exchanging many different concepts. A 
simple syntactic structure is introduced in the form of a 
binary combination process and the algorithm for fixing 
this structure is inspired by some known results relative 
to individual learnability of linguistic structures. In this 
way we can analyze the transition between syntactic and 
non syntactic communication on the basis of the social 
communicative potential of a linguistic structure, and not 
on the basis of the individual fitness or the velocity of in- 
dividual learning. 

From the analysis of this model, we can show that, 
under certain conditions, syntactic communication can 
reach consensus more efficiently than non syntactic com- 
munication, even if the the task of fixing a syntactic 
structure is more difficult. We are able to evidence some 
critical values for the number of exchanged concepts in 
dependence on the population size for which the emer- 
gence of syntax is viable. 
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